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Abstract 

In this note we employ methods borrowed from spin glass theory to study 
the phase space structure of fields in an inflating universe. In particular, we 
compute the overlap distribution of a suitably coarse-grained, massless scalar 
on a 1 + 1 dimensional (hence baby) de Sitter background, and find that (after 
an appropriate shift and rescaling) it is given by a Gumbel distribution. We 
also calculate the triple overlap distribution of this system, whose characteristic 
function turns out to be a product of two Gumbel factors. 
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1 Introduction 



We live in an expanding universe. There is very good evidence that this expansion is in 
fact accelerating [1, 2], and the central idea of the theory of inflation [3, 4, 5] is that in the 
distant past, for a period of time not long after the big bang, this behavior was a dominant 
feature of our universe. 

During such periods of accelerated expansion, different regions of the universe lose causal 
contact with each other. A distant star we see in the sky one night might not be visible 
anymore the following night, not because it had stopped shining, but simply because the 
light it emits will never reach us. 

On the other hand, we are used to thinking of physics in terms of local quantities and 
fields, which have correlators that fall off rapidly with distance. Let us consider an inflating 
universe endowed with a scalar field living in it. If this field is massless, its expectation value 
may vary widely from place to place. 1 Clearly, the field locally wants to minimize its energy 
density by not allowing steep gradients, but even small gradients can lead to vastly different 
expectation values over large distances. Once a neighboring region disappears behind our 
cosmo logical horizon, and we can no longer communicate with it, what is to stop the field 
in this region from drifting towards an expectation value completely different from what we 
measure in our vicinity? Thus small quantum fluctuations may be amplified into a wildly 
varying field profile. 

One might argue that a local observer should not care about expectation values in regions 
beyond his horizon. But now let's take a more global point of view. Imagine an observer 
that looks back at these disconnected regions after inflation has ended (when formerly out of 
contact regions are becoming accessible again) and samples the average expectation values of 
the quantum field in each of those regions. What is the distribution of these coarse-grained 
expectation values (and conjugate momenta) that such an observer would measure? 

Questions of this nature have been studied for a long time in the context of cosmological 
perturbation theory and structure formation [6, 7]. There is a large body of work seeking to 
predict what spectrum of matter density fluctuations a post-inflation observer should observe 
in the sky, assuming they arise from primordial quantum fluctuations and are modified 
according to the details of one's favorite inflationary model. 

But here we would like to take a further step back, and consider the statistical proper- 
ties of these coarse-grained variables in phase space. Given the thermal nature of the de 
Sitter vacuum, is there ergodicity breaking? Does this system exhibit a glassy structure, as 
suggested in [8]? 

We would like to shed some light on the global structure of the reduced phase space of 
expectation values (and conjugate momenta) corresponding to causally disconnected regions 
on a constant time slice. To this end, we will make use of tools and techniques from spin 
glass theory (see e.g. [9, 10, 11]), in particular the overlap and triple overlap distributions, 
which can be understood as order parameters for a non-trivial structure in the space of 

1 Needless to say, there are no such massless scalars with significant couplings to the standard model 
particles in our universe today, else we would have noticed their profound consequences for low-energy 
physics (though almost massless scalar fields do occur e.g. in models of slow roll inflation). Thus we should 
perhaps not think of this field as matter, but rather as an abstract probe of the properties of dc Sitter space 
itself. 
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thermodynamic states [12]. 

The application of spin glass methods to de Sitter dynamics was pioneered by F. Denef 
and D. Anninos [8, 13, 14, 15], who conceived the central ideas and were the first to investigate 
overlaps of various types on de Sitter spaces of different dimensionalities. They developed the 
formalism of reduced density matrix based overlaps that we will employ, and also wrote down 
the simple model that we will study, as an ideal playground to both test these techniques 
and attempt to tackle questions about the state space of de Sitter. In Section 2 we will 
review the relevant parts of their work, and remind the reader of some simple definitions 
and statistical notions borrowed from the theory of spin glasses. 

Section 3 describes a direct attempt to characterize the overlap, which will give us the first 
few moments of the distribution, while in Section 4 we introduce a slightly more sophisticated 
method, based on graphs and combinatorics, that allows us to find the complete distribution 
(at leading order in a late-time expansion). In Section 5 we generalize this method to compute 
the triple overlap distribution, before concluding with a brief discussion of our results. 



2 Setup and Review 
2.1 A Simple Model 

The model we will consider [15] could hardly be any simpler. As a background will take a 
1+1 dimensional de Sitter spacetime, whose metric we can write in global coordinates as 

ds 2 = -dr 2 + cosh 2 rd# 2 , (1) 

where we have set the cosmological constant to unity. A fixed time r = const, slice is then 
simply a circle parametrized by the angular variable 9 e (0, 2ir). On this background we 
study the evolution of a real, massless scalar field 0(r, #). For a constant time slice we can 
Fourier decompose its fluctuations in the obvious fashion 2 

oo 

= E 4>* , (2) 

n=— oo 

where reality requires that 0_„ = </>*. The Euclidean vacuum wavefunctional [16, 17] can 
then be written as 

OO Icy 

* = nv-^H^i 2 ) ' (3) 

n=l » n 

which simply takes the form of an infinite sequence of harmonic oscillators (and we have 
removed the zero mode). 3 

All this is very elementary and its seems hard to believe that anything interesting could 
come out of it. However, as we have remarked above, a local observer only has access to his 
own causal diamond, and thus cannot measure all of the Fourier modes of the scalar field. 

2 We have chosen a closed slicing here in order to obtain discrete modes. 
3 This is closely related to the string world-sheet vacuum studied in [18]. 
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The Penrose diagram of de Sitter is of course simply a rectangle, and the closer we are 
to future space-like infinity (the top of the diagram) the more causally disconnected regions 
there are on a constant r (which implies also constant conformal time) slice. Let's consider 
such a slice with 0(1/ e) regions of typical size e. Our future observer that looks back long 
after inflation has ended can in principle measure the average state (expectation value and 
momentum) inside all of these boxes 4 , and may choose to introduce corresponding coarse 
grained variables defined by convoluting the field profile with a suitable window function. 

Let us choose for this purpose a Lorentzian window (or box) function centered at 9i 




since its Fourier transform is conveniently given by an exponential, and thus the average 
value of the field inside this box is given by the convolution 

/oo °° 
d6 Bi{6 - 0i) <t>{6) = E e in *e- ne / 2 0„ . (5) 

Here we have taken the range of integration as infinite, which is an approximation that 
becomes exact in the limit of vanishing e which we will be interested in below (or alternatively 
we can think of the integral as simply wrapping around the circle). 

Note that while the vacuum (3) is a pure state in the full Hilbert space, this is no longer 
true in the reduced state space resulting from coarse-graining. What we are going to be 
concerned with are the statistical properties of these coarse-grained variables, and it is their 
overlap distribution we would like to compute. First, however, we will have to define this 
distribution and introduce some techniques for calculating it. 



2.2 Review of Overlap Distributions 

Let us briefly recall the basic concept of an overlap distribution and some useful definitions, 
as explained in [9, 10, 11, 12] and the excellent recent lecture notes [13]. 

In complex materials a thermodynamic state described by a density matrix p may at 
low temperatures break ergodicity (i.e. not all accessible microstates or phase space cells 
will be equally likely). This can occur as a trivial consequence of symmetry breaking (as 
in a ferromagnet), but in more non-trivial systems such as spin glasses ergodicity breaking 
takes place irrespective of any obvious associated symmetry. We should then be able to 
decompose the density matrix into different ergodic components (sometimes referred to as 
thermodynamic pure states) p = J2a w aPa- 

Different ergodic components in phase space are separated by very high free energy 
barriers, and thus no longer communicate with each other at low temperatures. The overlap 

4 In the case of a strict de Sitter universe inflation of course never ends, but we could imagine an unphysical, 
omniscient being that has access to the expectation values of the field inside all the causally disconnected 
regions on a space-like hypersurface. This latter point of view is of course not just unphysical in that no 
such observer exists, but it also opens up a Pandora's box of dicey questions concerning the interpretation 
of quantum mechanics in the context of cosmology. We will not discuss such issues here, so our discussion of 
de Sitter should be understood merely as an idealization approximating the post-inflation observer's point 
of view. 



4 



q a p between two such components is then defined by averaging over all degrees of freedom 
the product of their expectation values in states a and j3. E.g. for a system of N spins Sj, 
we would write q af3 = ± T,i(si) a (si) p. 

Usually it is not possible to explicitly construct the decomposition into thermodynamic 
pure states, but a quantity of interest that may be computable is the overlap distribution 

V(q) = ^w a w p S{q-q a p) = Ulq-^^2s}sA\ . (6) 

a,P \ V i J I 2 replicas 

The last equality is a rather non-trivial step, that forms the centerpiece of the replica formal- 
ism. It re-expresses the sum over ergodic components (which we cannot perform explicitly) 
as an expectation value of an overlap between two copies of the system. We will not re- 
produce the proof of this statement here, but instead refer the reader to the literature for 
justification [9, 13]. 5 

As we keep lowering the temperature, the breaking up of the phase space may continue 
in several steps, leading naturally to a tree structure in the space of states. This is best 
probed by computing the triple overlap distribution 

P(9i,92,53) = w aWpW 1 5(q 1 -q^)5(q 2 -q ja )5(q 3 -q a / 3 ) , (7) 

which can be similarly expressed in terms of a three replica expectation value. If a hierar- 
chical tree structure is present in the space of states it will be reflected in this quantity ex- 
hibiting ultrametricity [19], namely the property that out of any triple of overlaps (qi, q 2 , q 3 ) 
the largest two will be equal, as is the case for pairwise path-distances between any three 
leaves of a tree. This means that the distribution will have support only on a codimension 
one subspace where the three q a form equilateral or isosceles triangles. 

2.3 Introducing Wigner Densities 

Now let's consider more generally N degrees of freedom living in a Hilbert space H, and 
denote by %i the Hilbert space associated with one of them. We can find a reduced density 
matrix p l by tracing over all other degrees of freedom, and use this to define a new kind 
of overlap 6 , first introduced in [13], between two states labelled by a and (3 as Q a p = 
jf Ej Tr % PaPp- 

We can recast this overlap in a very suggestive form by working with Wigner distributions 
instead of density matrices. Recall that the Wigner distribution in phase space is defined as 
a Fourier transform of certain matrix elements of the density operator. In the case at hand, 
for a scalar field with momentum p, in the Euclidean vacuum state p — |^ f )(^ r |, we find 

/oo 
ds e is - p (0 + s/2|p|0 - s/2) = J] 4 e - 2n ^ 2 ~\ p -\ 2/{2n) , (8) 

n=l 

5 Note that there is nothing unphysical about working with two copies of the same system, and that this 
should not be confused with the so-called replica trick for computing quenched averages of the free energy, 
which employs an arbitrary number of replicas and then analytically continues that number to zero. 

6 We use the upper case symbol Q a p here to stress the analogy with q a p in the spin example, but note 
that the range in which this overlap takes values is shifted and rescaled compared to the definition above. 
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where s and p satisfy the same reality condition as 0, and the integral measure is ds = 
II^Li ds n , with the s n integrated over the whole complex plane. Here we have set Planck's 
constant to unity and used the shorthand s.p = | X^L-oc s nPn, again with the zero-mode 
removed. 

We can also write down the Wigner densities W l corresponding to the reduced density 
matrices for the average expectation value Xi and conjugate momentum P of the field inside 
a certain region Bi [15] 

w l = J d<j>dps(xi- J™ deB^e-e^m) & -/_" d9B0 -9 i ) P (9)^ w(<t>,p) , (9) 

where the integration measure is (2n)~ 2 EKLi d0 n dp n and the range again extends over the 
whole complex plane. 7 

Using these reduced Wigner distributions on the phase space for one particular degree of 
freedom, we can recast the overlap in the following form [13] 

Q°f> = £ / dP * W a( X i, P i) W H X i, P i) > ( 10 ) 

except that in the calculations below we will not actually work with N fixed boxes in positions 
9i and average over them, but instead replace this sum by an integral (2ir)~ 1 J" 27r d#j. 

This expression for the overlap can now be used to rewrite the overlap distribution as a 
two replica expectation value, in complete analogy with the right hand side of (6). While an 
expectation value of a delta function may appear to be a somewhat singular representation, 
the important implication is that the moments of the distribution [15] are given by 

where \y 123 '" K is a multivariable Wigner density defined exactly as in (9) except with K pairs 
of delta function insertions that set the average expectation values of the field (f)(9) inside 
the boxes at positions 9i equal to Xi and similarly equate the average conjugate momenta 
to Pi. 

For our particular choice of window function (4) we can find the reduced Wigner distri- 
bution by using the integral representation of the K pairs of (^-functions with integration 
variables \ for expectation values and \ for momenta, which gives 



W 



K 

123.. .K 



J°° (j{ dA fe dA fc e 2-(A fc x fc+ A fc p fc )^ (12 ) 



x exp [ -47r 3 e ]T e~ nt ]T ( + 2nA i A j 1 cosn(^ - 6j) 



™=1 »J=1 



Provided we can compute from (11) the moments for all integers K, we can find the 
characteristic function, and hence reconstruct the full overlap distribution V(q). If the 



7 The coarse-graining involved in the definition of the reduced Wigner densities reflects the thermal nature 
of the cosmological horizon of a localized observer; in particular, the associated density matrices no longer 
correspond to pure quantum states such as the Euclidean vacuum state (3) we stated with. 
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system has a non-trivial phase structure, which is related to failure of cluster decomposition 
as discussed in [13], we should find a non-trivial overlap distribution. In this sense this 
quantity can be understood as an order parameter for ergodicity breaking. 



3 A First Stab at the Overlap Distribution 

Having explained the basic setup of the computation, let us now tackle the calculation of 
the first few moments of the overlap distribution. From the analogy with complex materials 
in the previous section, it should be apparent that the box size e plays a role analogous to 
temperature, and therefore we will be interested in the small e (i.e. late conformal time) 
expansion in which there are many causally disconnected regions in our model universe. 



3.1 First Moment 

In order to find the first moment of the overlap distribution, we merely have to simplify the 
expression (12) for the reduced Wigner density with one index. There are no angular integrals 
to be performed (since by rotational symmetry the moments only depend on differences of 
angles), and the integrals over X\ and Pi simply lead to delta functions that set to zero the 
sums of the A« and \ parameters, respectively, for the two copies of the Wigner density. 
Evaluating the sums En=i 2ne ~ ne = (coshe - l)^ 1 and E~=i e _ne /(2n) = -§log(l - e~ e ) 
we find that the first moment is given by 

(q) = J dXidPi (W 1 ) 2 = j°° dAidAi exp ^-8n 3 e ^-^ log(l - e 
sinh 6 



2 



^ +Q(e(-loge)- 3 / 2 ) 




47r 2 ev /-log(l -e~ e ) 87rV-k>ge 

Note that the leading term is proportional to (— log e) -1 / 2 , which tells us that the average 
overlap goes to zero (albeit rather slowly) with conformal time. The precise coefficient of (q) 
depends our choice of window function (i.e. on how many boxes of a given shape and width 
e can effectively fit into our one-dimensional universe), and thus is not particularly physical, 
though it will be crucial when comparing to higher moments of the distribution. 



3.2 Second Moment 

To compute higher moments we have to perform angular integrals over the (differences in) 
box positions. For K = 2 there is only one independent angle (since one can always be fixed 
using the rotational symmetry) and the integral can be evaluated exactly. 
Using the sums 



JS, nc . A „. cosh e cos A9 — 1 . . 

Y In e~ ne cos (nA0) = — , 14 

^ v 1 cosh e - cos A6 v 1 

oo -rie i 

-s— cos(nAtf) = -- log(l - 2e- e cos A# + e^) , (15) 

n=l 2n 4 
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we find that the second moment is given by 

/ dX 1 dP 1 dX 2 dP 2 (W 12 ) 2 = \ „ - 1 - , (16) 

J 1 ^ 647T 4 e 2 v ^etM v / detM 

where the symmetric matrix M has entries given by equation (14) with A6 = — 9j, 
and similarly the symmetric matrix M has matrix elements given by equation (15). 8 

Due to the smoothing provided by the window function, all matrix elements are finite, 
and divergences can only occur when one of the matrices develops a zero eigenvalue (such 
that the determinant vanishes) which happens when the two boxes coincide. This singularity 
is not integrable, thus we have to be careful to exclude from the integral a region in which 
the angular separation becomes less than 0(e). 9 

It is not hard to see that the momentum determinant can be expanded in a power series 

in e 

' ' 2 +0(e 4 ). (17) 



VdetM 2 

Furthermore, even though subleading terms in this expression diverge for vanishing angular 
separation A9 = 0, after performing the angular integral over the interval (e, 27r — e), say, the 
contribution of the momentum determinant remains e 2 /2 to leading order. Thus we have to 
focus on the position determinant, and we will see below that this behavior continues for the 
computation of higher moments: the momentum determinant contributes only a constant, 
since the momentum correlators are short-ranged. 

It is convenient to rescale the parameters A, by (— 47r 3 elog(l — e^)) -1 / 2 so that the 
diagonal terms become e independent Gaussians, and 

(q ) = — / / , , , — (l-2e- e cosA# + e- 2e )- log(1 - e > . 18 

W/ 8n 2 e2J 2tt J-oc -47T 3 e log(l - e~ e ) V > V ; 

We are now supposed to integrate over the parameters Aj, and in the end over the angle 
A0, being careful to exclude a small region around A6 = 0. This is not very practical 
however, and we will instead switch the order of integration. 



8 Wc should remark at this point that in principle any reasonable (i.e. continuous and square integrable) 
window function could have done do the job instead of (4), and suitably regularized the computation of 
the matrix elements of M and M. However, the Lorentzian is particularly convenient, since it allows us 
to explicitly perform the resulting sums (14) and (15) which makes the following computations much more 
manageable. We have tried out different box functions, and found that the crucial long-range behavior 
of (15) was identical, but accompanied by various short range terms modifying this for small AO. If we 
grant that the long range behavior is universal, the detailed choice of box function should not matter for 
subsequent calculations of the shape of the (appropriately rescaled) distribution in the small e limit. 

9 A different method to treat coincident boxes, which may be conceptually clearer, is as follows. Instead 
of defining the the reduced Wigner distribution with an insertion of S(Xi — J d6Bi(j>(9)), we can broaden the 
delta function to a Gaussian of width S, i.e. instead of demanding that the averaged field be exactly equal 
to a specified value, we only ask for approximate (smeared) agreement. It is easy see that this ensures that 
matrix M never degenerates, i.e. has non-vanishing determinant even for coincident boxes. The same is true 
for the matrix M with the corresponding broadening of the momentum space delta function. We should 
then compute the moments of the overlap distribution as functions of e and S and in the end take the double 
limit of both small parameters going to zero. Unfortunately this makes the calculation somewhat unwieldy 
in practice. 
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Performing the angular integral first, over the full range (0, 2ir), which in this case can be 
done explicitly, leads to a hypergeometric function depending on e and the combination c = 
— AiA2/log(l — e~ e ) appearing in the exponent above. Of course, if we then try to integrate 
over the parameters Aj we again encounter the singularity originating from coincident boxes. 
For very negative values of the exponent c the hypergeometric function grows so fast that 
the Gaussian envelope in (18) fails to render the integral finite. 

It would appear that we have not gained anything, and still have to introduce a regu- 
lator to obtain a well-defined answer. However, at this point we recall that we are really 
interested in the thermodynamic limit, which in this case means the limit of small e. At late 
cosmological times, the (causally disconnected) boxes become very small, and two randomly 
chosen ones are very unlikely to be coincident. 

Taking the small e limit after carrying out the angular integral in (18) allows us to 
separate the divergent part of the answer from the C(e°(log e) n ) terms that we are interested 



in. 10 



Note that of course one cannot simply set e to zero from the start. It is crucial to extract 
the dependence on loge first, which is most easily done by rescaling the \ appropriately. 
In fact, going back to (18) one can check that performing the integral for finite e and then 
taking the thermodynamic limit is equivalent to letting e go to zero in the integrand, such 
that it appears only inside inside the logarithms, and then performing the integral. 

This leads to an expansion of the second moment in inverse powers of log e 

/ ax 1 ( 71-2 19?r4 1375tt 6 + 151200C 2 (3) 8 A 

<" > = =64^ ( 1 + 24l^ + 640togV + 21504 lo? / + ° ( '° g ^ ) ' (19) 

Higher order terms are easily computed by expanding further. Note that this expansion 
satisfies a transcendentality principle; each inverse power of log e comes with a power of 7r 
(or corresponding ( value). Also, the leading term is precisely equal to {q) 2 - 

In summary, our prescription for computing the overlap, which we will also apply to higher 
moments below, is to first rescale the auxiliary parameters Aj and take the thermodynamic 
limit, throwing away terms proportional to powers of e (which may have led to divergences), 
but being careful to retain the dependence on loge. After that, we perform the angular 
integral (s) over the full interval from to 2n, and in the end integrate over the (rescaled) Aj. 



3.3 Third Moment 

Computing higher moments of the overlap distribution requires us to perform multiple an- 
gular integrals, while being careful to treat coincident boxes properly. This can again be 
formulated as finding the determinants of M and M, which for the Kth moment are K by 

10 More precisely, if we consider the resulting hypergeometric function for small e, but keeping c constant, 
the expansion we find is not uniform. It is the sum of a power series starting at O(e ) and another power 
series that contains a factor e 2c and starts at C(e 2c+1 ). The latter contains the singularity at large negative 
c, and we drop it, while the former describes the behavior at small c (it dominates for c > —1/2), and thus 
we keep its leading term. Remembering that c goes to — AiA 2 /loge it is clear that the physically interesting 
region is that of small c. 

Put differently, the terms that lead to divergences in the Ai integral are 0(e 1 ) with a prefactor e~ 2AlA2 , 
while the terms we are interested are present at zeroth order in the e expansion (with correction in powers 
of loge), but don't contain the offending exponential factor. 
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K matrices, with the off-diagonal matrix elements and depending on the angular 
separation 6i — 9j, again given by (15) and (14). 
Hence for K = 3 we have 



/ 



FT dX.dP, (W 123 ) 2 = - \ XQ - 1 - . (20) 

l\ y ' (87r 2 e)3 ,/detM x/detM 1 ' 



As before one can show that the momentum determinant contributes only a constant 
factor, in this case 2~ 3//2 e 3 . Rescaling the \ and taking the e — > limit then leads to the 
expression 

3 = _^L_ dgidggdgg /- dA!dA 2 dA 3 a?-a1-aI 
W ' (47r) 3 7o (2tt) 3 7-oc (-47r 3 loge) 3 / 2 

Ai A? A j A3 _ A 2 A3 



<9i - 6> 2 \ / . 2 e 1 -e 3 \ — A . 2 e 2 -e 3 



loge 



x(W^j (W (4 sin 2 . (21) 

While it is not obvious a priori how to compute this integral exactly for arbitrary expo- 
nents, we can try to use a procedure similar in spirit to the replica trick: find an appropriate 
function that agrees with this integral for integer exponents (in which case the integral can 
be performed explicitly) and then use this function to extrapolate to small values of the 
exponent. This is discussed in Appendix A.l. 

Using the result (57) from the appendix we find that the third moment is given by 



(-647r 4 loge) 3 / 2 V 81og 2 e 41og 3 e 640 log 4 e 
9vr 2 C(3) + 108C(5) ^„ 6 A 
+ 32Ve +O( ' 0g e) J ■ (22) 

Again this obeys the transcendentality principle, and one can easily compute further 
terms if one so desires. 



3.4 The Lowest Central Moments 

We have already noted that to leading order (g 2 ) = (q) 2 and thus the second central moment 
(variance) is actually of order log" 3 e: 

^ = =64^7 (s£n + iiS&i + 0(log " £) ) • (23) 

More surprisingly, computing the third central moment we find that the first two leading 
terms cancel out 

tf)^fa 3 )-3tf)fa> + 2fa^ ( _ 6 J loge)3/2 (J|- + ^- + (log-^)) . (24) 

This raises the question of whether the Kth central moments {q K ) c are all of 0(\og~ 3K ^ 2 e). 
If this is the case there might be a closed form expression for the leading overlap dis- 
tribution (i.e. capturing the leading term of each central moment) as a function of q ~ 
(log 3 / 2 e)(g — (q)), without explicit e dependence. 
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In fact, the above cancellations suggests that perhaps one should set up the calculation 
directly for central moments, rather than computing ordinary moments and subtracting. 
Certainly, if we were to use numerical methods this would be absolutely necessary, and 
we will see below that this also leads to substantial simplifications in the exact analytic 
computation. 



3.5 Fourth and Higher Moments 

The computation of higher moments is formally very similar to what we have discussed 
above. The crucial integral that needs to be evaluated to find the Kth moment of the 
overlap distribution is given by 11 



I 



pj^ipj ( 4 sm'V0 . (25) 



2-7T 

fc=0 Z/l j>i>l 



Knowledge of this integral for all positive integers K, even if only as an expansion for small 
e, would allow us to reconstruct the complete overlap distribution. 

However, already for K = 4 this is rather non-trivial. Appendix A. 2 describes the 
treatment analogous to the one that worked for K = 3, while Appendix A. 3 discusses what 
we get if we expand in e first. It is clear from these discussions that it would be difficult to 
directly compute arbitrary higher moments in this fashion, and we instead have to think of 
a smarter method to achieve this. 



4 Central Moments and Graphs 

From the above we know that the overlap distribution is approximately a delta-function 
centered on (q) = (— 647r 4 log e)' 1 / 2 , and that we should really compute central moments to 
learn more about its shape. 

4.1 Expansion in Terms of Complete Graphs 

The central moments are given by 

(Q K )c = ((? - (q)) K ) = E (f) (q n K-( q )) K - n (26) 

n=0 \ n J 

= (q K ) - K(q K -')(q) + \k{K - l)^" 2 )^ 2 - . . . + (-l)*" 1 ^ " l)(q) K ■ 

Every term in this expansion, when written out explicitly in terms of Wigner functionals 
as in the previous section, will boil down to a certain angular integral of the type discussed 
above, which we can represent symbolically by a graph. If draw a vertex labelled by i for the 
angular integral / d0i/(2ir) and an (undirected) edge between vertices i and j for each factor 
of (4 sin 2 -jtzJ)-*;-v,7ioge^ correS p 0nc is to the complete graph with K vertices (i.e. 

11 Essentially this amounts to studying a one-dimensional model of particles on a circle with a pairwise 
interaction potential given by the logarithm of the chordal distance, and arbitrary real- valued charges A,. 
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the graph in which each vertex is connected to every other one by exactly one undirected 
edge). We will denote this graph by Ck- 

The next term in the expansion above has a factor of (q), which corresponds to an integral 
over an angle that nothing depends on, which is represented by a vertex that has no edges 
attached to it. This gives simply a pure number, which after factoring out a suitable overall 
coefficient is unity. The factor (q K ~ 1 ) on the other hand again corresponds to a complete 
graph, but with only K — 1 vertices. There are K such terms, and we can think of these as 
arising from the K complete graphs Ck-i that are subgraphs of Ck- 

Similarly, all subsequent terms in the above expansion can be associated to complete 
graphs of fewer vertices, and the combinatorics is such that there is precisely one term of 
coefficient one or minus one for every complete graph that is a subgraph of Ck- Finally, the 
last term is just a number, (— 1) K ~ 1 (K — 1), indicating graphs with no edges, which we can 
think of as arising from K complete graphs with one vertex (singletons) and one graph with 
no vertices (the null graph). 

Why is this relevant? Naively the Kth. moment will be of 0(\og~ K ^ 2 e), but we have 
seen above that due to certain cancellations at the lowest central moments are actually 
parametrically smaller than that. The graphical representation can help us rewrite the 
computation in a way the makes these cancellations manifest for all K, and allows us to 
easily identify the leading pieces of the answer in the small e limit. 

4.2 Expansion with Parametrically Small Edge Factors 

We will illustrate the idea with the simple example of K = 3. If we introduce the shorthand 
(ij) = (4 sin 2 ^-AtAj/iogc f Qr foe angular factor associated to the edge between vertices 
i and j, and declare it as understood that all angles will be integrated over with measure 
Jq 77 d9i/(2n), and all parameters A« with measure dAj exp(— A 2 )/(v / 7r), we have 

(-647r 4 lo ge ) 3 /V) c = (12)(13)(23) - (12) - (13) - (23) + 2 (27) 
= [(12)-1][(13)-1][(23)-1] 

+ [(12) - 1][(13) - 1] + [(12) - 1][(23) - 1] + [(13) - 1][(23) - 1] . 

Here we have rewritten the polynomial in (ij) in terms of shifted variables, which we will 
denote by [ij] = (ij) — 1. While the (ij) are of 0(1), the factors of [ij] are of (9(log _1 e), 
which makes it obvious that the right hand side is actually of (9(log -3 e) and that the leading 
contribution comes entirely from the term cubic in the rectangular brackets. 

To see this, we merely have to consider the small e expansion of the [ij]. In the first 
(cubic) term we can expand each factor to first order in log -1 e and the leading term will be 
proportional to A^A^A 2 , which makes a non-vanishing contribution under the \ integrals. 
In the remaining (quadratic) terms however, we have to the expand each factor of [ij] to 
second order in log -1 e, since the first order terms are odd in some Aj and thus vanish. Hence 
the contribution of these terms is subleading and of order log -4 e. 

It pays to again think of this in terms of graphs, except that now the edge factors are 
equal to [ij]. The first (cubic) term corresponds to a triangle (namely C3), and its leading 
contribution comes from the shortest loop 12 we can draw on a triangle (namely going around 

12 In graph theory this would be called a closed walk, since the word loop is reserved for an edge that 
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the triangle once) which corresponds to expanding each factor to first order. The quadratic 
terms on the other hand correspond to graphs with only two edges, and they contribute only 
once we expand every edge to second order, which again corresponds to the shortest loop we 
can draw on such a graph, namely going along the two edges and then back again. However 
this loop is of length four, and thus subleading compared to the triangle, which has a loop 
of length three. 

This will be the general theme: we will write the result for (q K ) c a s a sum over terms 
labelled by certain graphs, and the contribution of each term will be suppressed by as many 
powers of log -1 e as the length shortest link (set of loops) we can draw on this graph. 

As a slightly more non-trivial example, consider K = 4. Again the expansion in terms of 
the (ij) contains the complete graph C 4 and all its complete subgraphs C 4 _ n with coefficient 
(— l) 4_n , accounting for multiplicities arising from the fact that we consider the vertices as 
labelled, i.e. distinguishable: 

(-647r 4 loge)V) c (28) 
= (12)(13)(14)(23)(24)(34) - (12)(13)(23) - (12)(14)(24) 
-(13)(14)(34) - (23)(24)(34) + (12) + (13) + (14) + (23) + (24) + (34) - 3 
= [12] [13] [14] [23] [24] [34] 

+ [13] [14] [23] [24] [34] + (five similar terms with five factors) 

+ [14] [23] [24] [34] + (fourteen similar terms with four factors) 

+ [14] [24] [34] + (fifteen similar terms with three factors and without closed loops) 

+ [12][34] + [13][24] + [14][23] . 

The [ij] expansion on the right hand side has a more interesting structure. It contains C 4 
which has six edges, all of its subgraphs with five edges and also every subgraph with four 
edges. Out of the 20 possible subgraphs with three edges, there are 16 present, and the four 
that are missing are precisely the complete graphs C3, i.e. triangles. Similarly, all subgraphs 
of these four C3 are absent and thus there no graphs with one or zero edges. The only graphs 
with two edges are the three that are not subgraphs of any of the C 3 . All those that appear 
do so with unit coefficient. 

In other words, the right hand side contains exactly one term for every subgraph of C 4 
which has at least one edge attached to every vertex (of C 4 ), i.e. has no isolated vertices. 

Which terms give the leading contribution in this case? It is clear that the shortest link 
we can draw on a graph with K vertices none of which are isolated, in such a way that every 
edge is traversed at least once, is of length L tot = K. In particular, if the expansion above 
contained C 3 , there would be loops of length L = 3 covering that graph, but since this is not 
the case we have to look for loops of length L — 4. Among the 15 graphs with four edges 
there are three containing a loop of length four. Furthermore, the last line contains three 
graphs which consists of only two edges. Each of those edges is completely disconnected 
from the rest of the graph, and we will refer to these as dimers. On a dimer we can draw 
a loop of length L = 2 by going back and forth along the same edge, and we will call such 
loops trivial. Thus the three graphs in question also support links of length L tot = 4, namely 

connects a vertex to itself. There are no such loops in our graphs, so hopefully this abuse of terminology 
will not be confusing. 
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Figure 1: The six graphs that contribute to the fourth central moment at leading order. For 
illustrative purposes we have chosen to arrange the four (distinguishable) vertices equidistantly on 
a circle, which can be thought of a constant time slice of dS2- 

two mutually disconnected trivial loops each. Altogether only these six graphs, shown in 
Figure 1, will contribute at the leading order (9(log~ 4 e), which makes it much simpler to 
extract the dominant terms in the fourth central moment. 

For general K we can easily write down the (ij) expansion. It turns out that the overall 
factor we need to extract is equal to (— 647r 4 loge)~ x / 2 . Thus we can write (q K ) c as a sum 
over terms labelled by complete subgraphs of Ck, including singleton and null graphs. 

n 

(-647r 4 loge)^ 2 (^) c = £ (-1) K ~* J[ (ij) . (29) 

complete graphs with n vertices C Ck 

Our proposal is that when expanded in terms of the [ij] this is equal to the much more 
useful expression 

(-647r 4 loge)*/ 2 <g*) c = £ J[ [ij] . (30) 

graphs 5 with K non— isolated vertices C Ck edges ij £Q 

We have checked explicitly that this holds also for K = 5. The equality of the two graph- 
ical expansions on the right hand sides of these equations should be a simple consequence of 
the relation [ij] = (ij) — 1 (independent of the precise nature of the edge factor). 13 After all, 
it just an algebraic statement that two polynomials are equal, and the graphs simply serve 
as a convenient way of labeling the terms in the two expansions. 

13 An iterative proof of this lemma for general K might proceed along the following lines: 
First one considers just the term corresponding to the complete graph with K vertices, i.e. the product of 
all \K{K— 1) round bracket edge factors Iljjeejr fe')' an< ^ ex P an ds this in terms of rectangular brackets. It 
is not hard to see that this expansion contains a term Yljj g g [ij] with unit coefficient for every subgraph Q of 
the completely graph Ck, including the complete graph itself. This is because multiplying out products of the 
(ij) — 1, in order to express everything in terms of round brackets, and summing over all subgraphs, only the 
desired term associated with the complete graph survives. All other terms come with vanishing coefficient, 
since for terms with r edges removed from a total of ^K(K-l) we have X)p=o( — ^-) v (* K ^ ) (^ K ^~ 1 ^ p ) = 
0. 

Now in the round bracket expansion (29) we are interested in, terms corresponding to the K complete 
(sub)graphs with K — 1 vertices are subtracted from this. We can use the same identity again for each 
of them to express this as a rectangular bracket expansion containing terms with unit coefficient for every 
subgraph of Ck—i- Subtracting these corresponds to removing all terms that have exactly one isolated vertex 
(namely the one not contained in Ck-x) from the rectangular expansion. 

However, terms that have two isolated vertices have now been subtracted twice, so we compensate for this 
we add to the round bracket expansion the complete subgraphs Ck-2 that do not contain these vertices, 
and so on until we get down to the singleton and null graphs. We end up with a round bracket expansion 
that is an alternating sum of complete (sub) graphs versus a rectangular bracket expansion that contains all 
subgraphs except those with isolated vertices. 
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4.3 The Contribution of Cycle Graphs 



Why is the latter expression (30) so much more useful? We have already seen that it makes 
it obvious that the Kth central moment will in fact be 0(\og~ 3K ^ 2 e), and allows us to easily 
identify a small subset of terms that contribute to the leading order. This is because the 
only graphs that contribute at this order are those that can be covered by links of length 
L tot = K (i.e. on which we can draw a set of loops using every edge of the graph at least 
once such that in total the number of steps does not exceed the number of vertices). It is 
easy to convince oneself that the relevant graphs are always composed of a set of (mutually 
disconnected) cycle graphs (in which every vertex has degree two) and dimers (two vertices 
of degree one joined by a single edge). 

Not only are these graphs easy to identify, but we can also calculate their contribution 
explicitly. For a cycle graph this is much simpler than for a large connected graph (which 
we would have to compute in order to find the Kth moment directly), since every vertex is 
connected only to two instead of K — 1 other vertices. 

Using the Fourier expansion of the logarithm of (ij) given in (59) we find 



I 
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J2 2 cos(n(^_! - 6 i+1 )) + 0(log- 3 e) . 



71=1 



n 



(31) 



Concatenating further edge factors with this expression is trivial, since the result is self- 
similar - it simply leads to further factors of log -1 e and increases the power of n in 
the denominator that multiplies the cosine of the difference between the first and the last 
angle. Once we close the loop after L steps, there will be precisely two powers of \ for every 
vertex we passed through, and the sum of n~ L will result in a zeta value ((L). Thus if we 
identify 9 L = 9 and A^ = A we find for L > 3 



m 

i=i 



i,i] 




4sin' 



(21oge)- i 2C(L) + 0(log- L - i e ) . 



1 



(32) 



For trivial loops on dimers (L = 2) the same result holds, except that there is an extra 
factor of 1/2 since in this case we use the same edge twice and therefore have to expand to 
second order. 

Thus a loop of length L contributes a factor proportional to C(-^)- F° r a graph con- 
sisting of a number of disconnected cycles and dimers we simply multiply the contribution 
from each component. Since the total length of all loops in the link has to add up to K, 
this nicely confirms that the leading term of the (rescaled) Kth central moment will have 
transcendentality K. 



4.4 Counting Graphs 

We have identified the graphs that contribute to the leading term of the central moments, 
and calculated the individual contributions of such graphs. What remains to be done is to 
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count how many graphs will contribute for a given K. We will do this in two steps. First 
we need to enumerate the different families of graphs that are relevant, and then count the 
number of graphs within each given family. 

We distinguish families simply according to how many (mutually disconnected) loops of 
length L are present in the graph. We can characterize this by a set of integers itll that give 
the multiplicities of loops of a given length. In particular, m 2 gives the number of dimers, 
and the uil for L > 2 count the number of cycles of length L contained in the graph. We 
have already argued above that the graphs relevant to the leading central moments must 
obey the constraint 

]T m L L = K . (33) 

L>2 

In other words, the families of graphs relevant for the Kth central moment are labelled 
by the integer partitions of K with the additional constraint that each summand has to be 
at least two. 14 We will denote the space of such integer partitions by P(2,K). 

How many graphs are there in a given family? If there are cycles of a given length 
L we can begin by choosing m L indistinguishable sets of L out of K vertices, which can be 
done in 

'K - (m L - 1)L\ K\ 




"V L J m L \ (L\)™l (K - m L L)\ 

ways. E.g. we can choose to start with L = 2 and then pick m 3 indistinguishable sets of three 
vertices out of the remaining K — 2m 2 vertices and so on, which results simply in a product 
of factors of the form given above with K replaced by the number of vertices remaining after 
each step. 

Once we have picked sets of vertices we have to count how many different ways there are 
to make a cycle out of L vertices. It easy to see that this is L\ modulo discrete rotations and 
orientation reversal, i.e. (L — l)!/2. 

Again, the L = 2 dimer is a special case, since there is clearly exactly one way of 
connecting to vertices with one edge, i.e. the formula for cycles must be amended by an 
extra factor of 2. 

In summary, for a given integer partition itll G P(2,K), which specifies the family, the 
number of different graphs is given by 



Kl 2 m2 TT ulr, ■ (35) 



4.5 Explicit Expressions for Central Moments 

Putting everything together, the leading central moments that follow from (30) by summing 
over all families of graphs labelled by integer partitions, making contributions (32) with 
multiplicities (35), are given by 

(^) c = (-64 7 r 4 log6)-^ 2 (21og6)-^! £ ]J -±- + (log-* K ^ e) . (36) 

m L eP(2,K) L>2 m L- \ L J 
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This is known as an intermediate function in number theory. 
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Happily, the extra factors of 1/2 from having to expand to second order for dimers precisely 
cancel the extra factors of 2 from counting graphs with dimers, so that the L = 2 case 
appears no different than L > 3 in the end. 

This expression agrees with the leading terms in (q 2 ) c and (q 3 ) c found above, and gives 
the following results for the subsequent central moments 

(q 4 ) c = (-2567T 4 log 3 e)~ 2 ^ + 0(\og~ 7 e) , (37) 
(g 5 ) c = (-2567T 4 log 3 e)^ 2 (^C(3) + 24C(5)) + 0(log- 17 / 2 e) , (38) 

(q 6 ) c = (-2567T 4 log 3 6)" 3 ( — + 40C(3) 2 J + 0(log- 10 e) , (39) 

and so on. Here we have made use of the fact that zeta functions of even argument are 
expressible as rational numbers times powers of ir. As remarked above the central moments 
have uniform transcendentality and are C*(log~ 3K / 2 e). This motivates us to define a new 
variable 

Q = (-2567T 4 log 3 e) 1/1,2 (<? — (q)) , (40) 
in terms of which the overlap distribution will be smooth in the small e limit. 



4.6 Characteristic Function and Overlap Distribution 

Knowing all central moments (at least to leading order) we would now like to reconstruct 
the overlap distribution. This is best done via the characteristic function, which is the 
expectation value of a complex exponential with frequency oo. 

Our result (36) involves a sum over integer partitions, so it is helpful to know that the 
generating function of (unrestricted) partitions is given by ]1lLi(1 — x 1 )^ 1 . We can easily 
modify this for our restricted case in which each summand has to be at least two, by simply 
omitting the L = 1 factor. However, this is still not exactly what we need, since we are 
interested in integer partitions that come with additional factors of 1/tol! depending on the 
multiplicities with which the integers L appear. Fortunately for us, this case is even easier: 
the generating function is simply a product of exponentials, and thus 

<^>=E<^=nex P (M^>) 

K=0 L>2 \ ^ J 

= e- 1 ^ T(l - iuo) , (41) 

where we have used Y j l>2{'^ uj ) L C{L) / L = — ryco>+log(r(l— iu)) and 7 is the Euler-Mascheroni 
constant. 

Using the integral representation of the T-function we can then Fourier transform the 
characteristic function to obtain the original overlap distribution for the (shifted and rescaled) 
variable q: 

\ f OO P OO 

V(q) = — due-^(e ! ^)= dt e~* 5(q + 7 + logt) 

2lT J -00 X ' Jo 

= exp(-g - 7 - e-^) . (42) 
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This is known as a Gumbel or log-Weibull distribution (see Figure 3 below). It is easy 
to check that it is correctly normalized and reproduces the leading terms of all the central 
moments we have computed above. 

Of course we have assumed here that the domain of V(q) is the whole real line. While 
this is not strictly true, it is not a problem in the small e limit we are interested in, since 
q is a rescaled variable, and in terms of the original variable q the distribution looks highly 
compressed (almost like a 5-function). The distribution falls off exponentially in the positive 
direction, and even faster (as a double exponential) in the negative direction, so any tails 
outside the original domain of V(q) are highly suppressed for small e. 



We will now turn to a finer probe of the structure of the space of states, namely the triple 
overlap distribution. Here we consider three copies (replicas) of the system under consider- 
ation, and compute the multivariate distribution of mutual overlaps q a b between them. In 
order to avoid an overabundance of indices we will use dual variables q a = \e a b c \ qbc/2, and 
similarly for other quantities. 

5.1 Moments in Terms of Wigner Functions 

We can express the moments of this distribution in terms of coarse-grained Wigner functions 
as follows: 



where there are three groups of degrees of freedom 15 such that K = K l + K 2 + K 3 . For 
simplicity we have chosen a particular ordering of the K degrees of freedom, which makes very 
explicit that every set of variables (9i,Xi, Pi) for a given index i is associated with exactly two 
of the three replicas (but of course any permutation would be just as good). Reconstructing 
the full triple overlap distribution from its moments will allow us to determine, amongst 
other features of the space of states, to what extent the system exhibits ultrametricity. 

We can use the representation (12) for the Wigner functionals, where the parameters \ k 
and Afc should be thought of as carrying an additional index to indicate which replica they 
belong to. As above, however, performing the integral over the coarse-grained expectation 
values Xi and corresponding momenta Pi leads to delta functions that allow us to eliminate all 
but one complete set of (A&, A&) parameters, such that the extra index essentially disappears. 

To illustrate this in more detail, let's assume that the momentum integral again only 
contributes terms suppressed by positive powers of e, and rescale the A& as above, so that 

15 Note that the K a are dual variables corresponding to the more natural K a b, whereas the Wigner distri- 
butions have only one index to begin with since they are associated with one replica only. 



5 Triple Overlap 




l,2,...,K 1 ,K 1 +K 2 +l,K 1 +K 2 +2...,K 




(43) 
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Figure 2: Illustration of the triple overlap computation. On the left the three copies of the Wigner 
distribution are shown, with all edge factors that appear in them. One the right, we superimpose 
the three pictures, which corresponds to the situation after integrating over the coarse-grained 
variables (Xj,Pj) and eliminating all (Aj,Aj) parameters that appear in ^-functions. Edges within 
one group appear twice on the left, and after being identified are drawn as thick lines on the right. 
Edges that stretch between two groups appear only once, and are suppressed by a factor of — |. 



up to numerical factors the Wigner distribution effectively can be written as 

w i,2,3...k^ f[(d\«e-W 2 / 2 e 2 * a ° x i) f[ Usin 2 ^^ . (44) 

J -°° i=i k>j>i V 2 / 

Multiplying three such factors and integrating over the (for simplicity also rescaled) X[ 
will set sums of two A" with the same % but different replica index equal to zero. Integrating 
over those delta functions, and choosing the remaining set of parameters \ in a symmetric 
fashion, the resulting formula for the moments of the triple overlap distribution reduces to 
basically the same expression (25) we have already encountered. 

The overall K is now interpreted as the sum of the K a , and the left-over parameters Aj 
are integrated over with Gaussian weights as before, except for one crucial difference: the 
angular factors connecting two degrees of freedom in different groups have an extra factor of 
— | in the exponent. This is clear, since as shown in Figure 2, those factors appear only in 
one of the three Wigner distributions (rather than in two of them such as those connecting 
degrees of freedom in the same group). The most obvious choice of parameters is to define 
e.g. Aj = A 2 = — Af for i = 1,2, . . . ,Ki, which introduces minus signs whenever these is a 
connection between the first and the second group. Similarity transformations, leading to 
less symmetric choices of parameters, can eliminate some, but not all of these minus signs. 

To put it differently, if we carry out the Gaussian Aj integrals first and think of the 
problem as computing the (inverse square root of the) determinant of a K by K matrix My, 
the crucial difference in the structure of the relevant matrix compared to the My considered 
above is that block off-diagonal elements are multiplied by — |, whereas entries in the three 
K a by K a blocks on the diagonal are the same as before. This is very reminiscent of the 
form of the overlap matrix in a system with one-step replica symmetry breaking (even though 
there appears to be no direct physical relation). 
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5.2 Computing Moments Using Graphical Techniques 

Since the expressions for the moments of the distribution are so similar, the calculations of 
Section 3 translate immediately into results for the first few moments of the triple overlap 
distribution. This will not take us very far however, so we'll turn directly to applying the 
method of Section 4 to this problem. 

We have learned that we should really be computing the central moments 

(q^€ 2 € 3 ) = (-2567r 4 log 3 6)^ 2 ((q, - ( qi })^(q 2 - (g 2 ))^(g 3 - (g 3 »* 3 ) ■ (45) 

As above, we will associate to each of these central moments a set of graphs, with K 
vertices labelled by the index i, and edges corresponding to the angular factors connecting 
two vertices (i.e. depending on the difference of two degrees of freedom). However, now 
there is additional structure to the problem: the vertices are split up into three mutually 
exclusive groups, with the number in each group given by K a , and the edges come in two 
varieties, namely those connecting two vertices within one group, and those connecting 
different groups. 

The arguments of Subsection 4.2 go through as before, since they don't rely on the exact 
form of the edge factors, and thus we can write the leading term of each central moment 
as a sum of terms labelled by graphs with K non-isolated vertices on which minimal length 
links can be drawn. 

The contribution of each cycle or dimer contained within such a graph also follows easily 
from the calculation in Subsection 4.3. We simply need to insert appropriate factors of — \ 
whenever we cross the boundary between different groups of vertices. 

However, we face a rather nontrivial counting problem: how many (unoriented) graphs 
with K vertices are there which consist of disconnected cycle or dimer components (sub- 
graphs), in which each vertex is part of exactly one disconnect component, and also belongs 
to exactly one of three distinguishable groups with K a members, such that the boundaries 
between groups are crossed a given number of times? 

5.3 More Graph Combinatorics 

Of course we are not really interested in the number of graphs as such, but rather would like 
to compute a weighted sum over all of them with the appropriate factors attached to each 
one, and thus we need a complete classification of the graphs that contribute to the central 
moments at leading order. We will do this in several steps. 

At the highest level, the relevant graphs are characterized by the family they belong 
to. As discussed in the previous section, we specify the family by an integer partition of K 
written qsitll G P(2, K) (which each summand at least two), which counts the multiplicities 
of loops (or dimers) of length L. We know from the above that within each family there are 
K\2 m2 Y\ L>2 (mL\)~ 1 (2L)~ TnL distinct graphs. We obtained this by first counting the distin- 
guishable ways of assigning each of the K vertices to one of the loops, and then multiplying 
by the number of ways we can string together (wire up) a given set of L vertices into a loop 
of length L. 

For the present problem however, we need a finer classification, since vertices are now 
assigned to one of three groups and we need to consider whether edges cross between different 
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groups. Therefore, when assigning vertices to loops of a certain length L we have to consider 
how many vertices to take from each of the three groups. The number of ways of doing this 
is given by 



X £ ( fit) Lm 2 L ) ) SW L + nT + nT ~ L) 



1 v v K X \K 2 \K Z \ 



n[...n 1 ^=0 n! 2 ...n 2 ^=0 



X {Ki - EjTii nf)!(tf 2 - £™ii rtf)!(*s - £™ML - nf - n p 2 ))\ ' (46) 

The factor of l/ra^! appears because loops of the same length are interchangeable at this 
level. For each loop there is obviously a constraint that the numbers of the vertices drawn 
from the three groups have to add up to L. When we go through all the possible values of L 
we will multiply factors of this type until all vertices are assigned. Thus, when we are given 
a set of integers {K 1: K 2 , K 3 } there are also global constraints which impose that the total 
number of vertices drawn from each group has to add up to the correct K a . Demanding this, 
the total number of possible vertex assignments is 

oo oo 

L,v L \ r/ t/- L,v L \ 



£ E K ± \K 2 \K 3 \ 5{K X - ]T n ? VL ) 5(K 2 - £ n L 2 ' VL ) 

{ni'^}=0 {n^}=0 



1 mL 1 
TT — TT (47) 
l>2 UIl - ii L =\ n \ - n 2 '■{-L' — n 1 — n 2 y. 



where there are parameters n L '^ L appearing inside the sums (and products) for every L for 
which m L > and /x^ = 1, 2 . . . m^. 

Thus for given K a we can define a genus (think biology, not topology) of graphs belonging 
to a given family by a set of pairs of integers , n 2 ,/lL } where pairs associated with 

the same length L but different labels fx L are interchangeable (i.e. sets differing only by 
swapping pairs with the same L are considered equivalent). A valid set has to satisfy the local 
constraints n\ ,flL + n 2 ' pL < L for all L and fi^ and the global constraints El,u l n^ ,flL = K a . 

However, even within a given genus there are graphs with different factors associated 
with them, because even once we have assigned vertices to loops of given length there are 
still different ways of wiring up the vertices into a cycle, which may cross the boundaries 
between groups a different number of times. As a simple example, consider a cycle of length 
four for which we have chosen two vertices from one group and two vertices from another. 
Clearly, we can draw loops of length four that cross between the groups either twice or four 
times (as is apparent from the three leftmost graphs in Figure 1, considering say the top two 
vertices as belonging to a different group than the bottom two). 

Thus within a given family and genus, we can think of a species of graphs as a subset 
which for each cycle it contains has a specified number of edges between vertices belonging 
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to different groups, each of which picks up a factor if — | relative to edges within one group. 
We define for each cycle a loop factor £ L (ni,n 2 ) that sums over all possible cycle graphs 
within a given genus and weighs each species by the appropriate power of — \ (taking into 
account the correct population of each species). In the example of the previous paragraph, 
we can draw two different graphs that cross boundaries between groups twice and one graph 
that has four crossings. Therefore for this loop the correct factor would be £ 4 (ni = 2,n 2 = 
2) = 2(— |) 2 + 1(— t,) 4 = ^7. The total loop factor will simply be a product of such factors 
for each cycle (or dimer) contained in the graph. 

In summary, in order to compute a given moment with exponents K a all we need to 
do is sum over all relevant families, genera and species of graphs with zeta function values 
depending on the family, and loop factors taking care of the numerical suppression associated 
with edges crossing borders as well as multiplicities within each species. To execute this we 
would have to overcome two difficulties: firstly we would need an explicit formula for the 
loop factors C L (ni, n 2 ), and secondly we would have to find an efficient way of implementing 
the global constraints when summing over genera. Fortunately, it is not necessary to perform 
either of these steps explicitly. 



5.4 Characteristic Function from Matrix Powers 

An elegant way to avoid the ugly details that arise when computing individual moments in 
this fashion is to work with generating functions. Consider the matrix 

u\ —ui/2 — U\/2 S 

Vl= \ u 2 II -1/2 1 -1/2 I = I -u 2 /2 u 2 -U2/2 I . (48) 
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<j0 2 


w 3 ) ( 











-W3/2 -W3/2 UJ 3 
It turns out that it generates precisely the loop factors we require 

Intuitively, the matrix powers of Q encode the fact that there is a penalty for going from 
on group to another (in the off-diagonal elements of Q), while at the same time taking into 
account all the different paths that visit each vertex once. The trace makes sure that the 
path closes in the end. Again, the dimer case is special and gives rise to an extra factor of 
two. 

The above expression, which gives us a linear combination of all loop factors for a given 
L, rather than just a particular one, is precisely what we need to compute the characteristic 
function of the triple overlap distribution. This also avoids the second problem of having to 
impose global constraints: since we have to sum over all K a , the issue no longer arises. 

Neglecting all subleading terms (suppressed by additional inverse powers of loge) the 
characteristic function is then given by 

(exp(«wigi + tu 2 q 2 + ^3^3)) = 2^ K \ K \ K \ ^ ?2 ^ ' 

K u K 2 ,K 3 =0 A1./V2./V3. 
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= E E E E E II - E 

K u K 2 ,K a =0m L eP(2,K) {n ^ L}=0 {n ^L }=0 {n ^L }=0 \o=l 

^exp^E^tr^j . (50) 

Again, the extra factors of two for the dimer cancel, and performing the sums over the K a 
first, then summing over the n^ ,Mi using (49) and finally using the generating function for 
integer partitions, the expression simplifies dramatically to a non-Abelian version of (41). 

5.5 A Tale of Two Gumbels 

The non-zero eigenvalues of Q are given by 

W± = -{Ui + U 2 + 0J 3 ) ± ^ ^Juf + Uj\ + w| - WiW 2 - ^1^3 - ^2^3 , (51) 

and therefore the trace of Q L is simply equal to uj\ + u L . 

If we introduce cylindrical polar coordinates aligned with the equilateral axis 

u z = + w 2 + w 3 ) , 



= ^V^l + W 2 + w 3 - ^1^2 - ^1^3 - ^2^3 , 



w v = arctan f 1 —\ , (52) 

and similarly for the dual coordinates (q z , q r , q v ), we find that the characteristic function can 
be written as 

i Ux q x +i Ur q r <x*( Uv -q v )\ = g-^^p^ _ ^^(j _ ? (53) 



e" 

where we have performed the sum over L explicitly, and cu± = (y/3/2)(u z ± tu r /\/2). 

The characteristic function of the triple overlap distribution is independent of the angular 
variable u v . In fact, it is simply a product of the characteristic functions of two Gumbel 
distributions, and thus its Fourier transform will lead to an expression resembling a convolu- 
tion (though not exactly, because the u± integration region is only a half-plane). Performing 
this Fourier transform we obtain, using again the integral representation of the T-function 

V(Qi, 92, qs) = j^y, J duAurd^ co r e" 1 ^ (54) 

f°° dse~ s /°° ( ite _ *e _iv/5(logs+log * )tJz/2 e~ iv/5(logs ~ log * )cJr/(2v ^ ) . 
Jo Jo 
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Figure 3: Overlap distribution V(q) (left), and triple overlap distribution V{q z ,q r ) (right). 



Since the integrand is independent of u v except for the complex exponential of u.q, the 
Up integral will simply pick out the lowest order Bessel function of the angular expansion 
of this factor, and the result will be independent of q v . The u z integral leads to a delta 
function which trivializes one of the parametric integrals. After an appropriate change of 
variables, the second parametric integral then takes the form of the integral representation 
of the modified Bessel function of the second kind 

rod 

KJx)= due- xcoshu cosh(vu) , (55) 
Jo 

but with purely imaginary order v. This leads to the following expression for the triple 
overlap distribution 

V(q z , q r ) = ^= / o °° du r u r J (u r q r ) e~ 2 ^^ K lV ~ 3uJr/V - 2 ^e"^/^) . (56) 

This peculiar type of integral, with the integration variable appearing in the order of the 
modified Bessel function, is known as a Kontorovich-Lebedev transform (or its inverse, de- 
pending on conventions). One can check that this probability density is correctly normalized, 
and that the first moment (q z ) vanishes 16 as required, though for the computation of higher 
moments it is more convenient to directly expand the characteristic function (53) instead. 

6 Discussion 

We have introduced a neat graph-based technique that allowed us to explicitly compute the 
overlap and triple overlap distributions for a coarse-grained, massless scalar field on a 1+1 
dimensional de Sitter background by largely combinatorial methods. 

What have we learned from these calculations? The obvious answer, which was of course 
entirely expected, is that in the naive late time e — > limit, in which the number of causally 
disconnected boxes in our model universe diverges, the overlap of a scalar field in (IS2 tends 
to vanish (i.e. the limiting distribution is a delta function centered on zero). This simply 

16 This might not be apparent from the plot of the probability density, which for small q r is clearly peaked 
in a region of negative q z . Recall however that the integration measure contains another factor of q r , which 
mitigates this, and enhances the tail of the distribution which is more pronounced for positive q z . 
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confirms our intuition that in this limit the convolution with the box function merely picks 
out a perfectly localized harmonic oscillator degree of freedom, and since the underlying 
field theory is free there is no reason to expect anything non-trivial to emerge from such a 
collection of harmonic oscillators. 

However, it is a remarkable fact that if we look more closely, subtract the mean and 
scale the central moments of the overlap distribution appropriately, we find a rather non- 
trivial shape for the overlap, which is given by a Gumbel distribution (42). Thus we have 
obtained a precise characterization of how likely deviations from the above trivial behavior 
are. Whether we accept this as evidence for a non-trivial phase structure, depends primarily 
on whether or not we consider the rescaled distribution physically relevant. 

There may be good, physical reasons to perform this shift and rescaling, since in a sense 
it merely removes the obvious effect of an inflating universe, which is the tendency to dilute 
overlaps between states on an absolute scale, maybe not unlike the situation in a spin glass 
to which we keep adding spins according to some rule. We will not speculate on this further, 
except to note that the rule in this analogy is by no means random; the rescaling of the 
overlap we had to perform is merely by powers of — log e, whereas the number of causally 
disconnected boxes grows as e _1 . 

Interestingly, the Gumbel density is also the distribution of a much simpler quantity, 
namely the (regularized and zero mode subtracted) self-overlap / d6\(p(6)\ 2 , as explained in 
[8] . A priori there was no reason to expect this heuristic notion of overlap to agree with the 
full phase-space definition (certainly we wouldn't in general), and this should presumably 
be taken as a further testament to the simplicity of the system we are studying. 

We have also obtained an explicit expression for the triple overlap distribution (56) which 
adds this system to a rather short list of models in the literature whose triple overlaps are 
known in closed form. 

Again, we could consider first the naive e — > limit, in which this distribution approaches 
a delta function supported on equilateral triangles (with all three q a equal to each other), 
whose size tends to zero as (— loge)^ 1 / 2 . While this limiting distribution is technically 
ultrametric, its ultrametricity is of a trivial nature, since as we have discussed above, in the 
same limit the regular overlap distribution also approaches a delta function, precluding any 
interesting phase structure. Furthermore, in a non-trivial tree the pairwise path distances 
between three arbitrary leaves should be allowed to take different values (only the largest 
two, but not necessarily all three of them should have to be equal). 

However, the appropriately shifted and scaled triple overlap distribution is perfectly 
smooth, as shown in Figure 3, and given by a rather intriguing Kontorovich-Lebedev integral 
(56). Still, it is invariant under rotations around the equilateral (i.e. z) axis, which implies 
that isosceles triangles are not particularly preferred. If there were a non-trivial tree structure 
underlying these vacuum fluctuations, we wouldn't expect the triple overlap distribution to 
have support everywhere around the equilateral axis, but only on the three half-planes 
described by q\ = qi > q^ and permutations thereof. Thus while the rescaled triple overlap 
distribution certainly has a very interesting structure indicative of ergodicity breaking, we 
do not see evidence for a hierarchical organization of the state space 17 . 

17 The expression (53) was subsequently rederived as the triple distance distribution of a different, simpler 
measure of overlap in [8], where it was used to argue in favor of ultrametricity of the model. This was based 
on the observation that if one restricts to rare events in the tail of the distribution, the conditional probability 
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This result might be a limitation forced upon us by the low dimensionality and simple 
dynamics of our model. Higher dimensional de Sitter spaces may exhibit a richer structure, 
and we hope that the calculations presented here can be extended, at least at some level of 
approximation, to these cases. Other possible generalizations include introducing masses and 
self-interactions, or considering non-scalar fields, and we hope that the study of such systems 
will reveal further interesting facts about the statistical properties of inflating universes at 
the largest scales. 
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A Some (not so) Easy Pieces 

Here we collect some expressions that are relevant for the preliminary computations of Sec- 
tion 3 and might potentially lead to an alternative, simpler derivation of the main results of 
this paper (and generalizations thereof). 

A.l Angular Integral for K = 3 

Let's introduce the short-hand A^ = — loge. We need to evaluate the angular integral 
in (21), which in general is not trivial. Even though we are really interested in small values 
of the Xij, we can proceed by computing the integral for integer values of the Ay first. This 
is simpler, since it allows a binomial expansion of the integrand, with enables us to express 
the answer as a multiple sum with constraints implemented by Kronecker deltas. Analytic 
continuation will then give us the answer for general values of the parameters. 

'P(Qi\q , 2i Q3) is peaked in the region where the three q's form isosceles triangles. However, while intriguing 
in its own right, this approximate localization property is clearly much weaker than ultrametricity, whose 
definition requires a sharp localization on a two-dimensional surface. Furthermore, it is really a property 
of a particular conditional distribution, rather than of the joint triple overlap distribution, since it depends 
crucially on the choice of conditions (which amounts to specifying a scheme for comparing isosceles triangles 
with others that would not be present in an ultramctric distribution). In this case the conditions imposed 
explicitly break the rotational symmetry of the joint distribution about the line of equilateral triangles, which 
maps isosceles triangles to non-isosceles ones. On the other hand, if we looked at a conditional distribution 
that respects this symmetry, e.g. by keeping q z (i.e. the sum of the three sides of the triangle) constant, we 
would find no such approximate localization. 
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(2tt) 3 ^ /2A 12 \ ^ /2A 13 \ ^ (2X 

( _4)a 12+ a 13+ a 2 3 2^ [ ki2 ) y ki3 ) ^ h 



x (-l) fc -+^+/c 23 5(A 12 - fc 12 + A 13 - fci 3 ) S(-X 12 + k 12 + A 23 - A; 23 ) 



2.3 



^23 



(2tt) 3 (-l) fc, (2A 12 )!(2A 13 )!(2A 23 )! 

4 a 12+ a 13+ a 23 (Ai2 _ fc , )!(Ai2 + fc , )!(Ai3 _ fc/)!(Ai3 + fc/)!(A23 _ fc/)!(A23 + k ,y 

= g7r3 / 2 r(| + A 12 )r(| + A 13 )r(| + A 23 )r(i + a 12 + a 13 + a 23 ) 

r(i + Ai 2 + A 13 )r(i + a 12 + A 23 )r(i + Ai 3 + a 23 ) 1 } 

If we had performed the computation for general values of the from the start, the prin- 
cipal differences we would have encountered are generalized binomial sums that no longer 
terminate, and Kronecker deltas being replaced by sinc-functions. 



A. 2 Angular Integral for K = 4 

For the angular integral relevant to the fourth moment we would like to use the same trick as 
in appendix A.l above, namely compute for integer exponents and then analytically continue 
to small values of the \j. In this case there are six angular factors, and four integrals, which 
lead to three independent Kronecker deltas. Proceeding exactly as for the K — 3 case we 
find 

rntid^n^^sin 2 ^)'^ (58) 

(2^ V- (-l) fc, i2+ fc i3+ fc 23(2A 12 )!(2A 1 3)!(2A 1 4)!(2A 23 )!(2A 2 4)!(2A34)! 

4A 12 +a 1 3 + a 14+ a 23+ a 24+ a 34 l^k' 12 ,k' 13 ,k' 23 (A 12 -fe' 12 )!(Ai2+fc; 2 )!(Ai 3 -fci3)!(Ai3+fc' 13 )!(A 23 -fc^)!(A 23 +fc^)! 

x 1 

(Al4-tf 12 -ki3)!(Al4+tf 12 +ki 3 )!(A24-tf 12 +^s)l(^ ' 

Unfortunately, we do not know how to compute this triple sum in closed form, and with- 
out such an expression we cannot analytically continue to small values of the \j. Rewriting 
the sums as contour integrals does also not seem to lead to an expression that readily lends 
itself to contour deformation and analytic continuation. 



A. 3 A Modest Proposal 

Of course, since we really care about small exponents we could be more modest and simply 
calculate the relevant integrals as a power series around \j = 0. If we expand in \j first we 
will have to compute integrals with multiple factors of 

/ /\f)\ °°/9\ 
log Usin 2 — =E cosnAtf, (59) 



2 / V n 



where we have indicated the Fourier series of this expression, which will be useful below. 
Let us illustrate the procedure with the simplest case of K = 2 



L 



.2,r / Af)\ X /-27T 00 \ ™ m 00 / 9 \ 

f dM(W-) =/ o dA^-n£(--)co Sn ,Ae 



= 2«t t T±---± E ^ (± " 1± " 2 "- ± "- ) . (60) 

m=0 m - ni=l n m =l 2 m sign combinations 71^2 ■ ■ ■ n m 
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We have to sum over an m-dimensional integer lattice with a constraint imposed by a Kro- 
necker delta. The coefficient at 0(\ m ) should be equal to C( m ) times a rational number. 

This method generalizes straightforwardly to higher K: in general the number of rii 
factors in the denominator is equal to the total exponent of the XijS and there are K — 1 
constraints, again summed over all possible sign combinations. If we could efficiently perform 
such constraint (^-function type lattice sums we would have solved the problem, but in the 
absence of good technique to achieve this it is not clear if modesty is really advantageous in 
this case. 
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